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Abstract 



. A class of one-dimensional convolutional codes will be presented. They are all MDS 

codes, i. e., have the largest distance among all one-dimensional codes of the same 
length n and overall constraint length S. Furthermore, their extended row distances 
are computed, and they increase with slope n — S. In certain cases of the algebraic 
parameters, we will also derive parity check matrices of Vandermonde type for these 
^ ' codes. Finally, cyclicity in the convolutional sense of [7] will be discussed for our class 

of codes. It will turn out that they are cyclic if and only if the field element used 
in the generator matrix has order n. This can be regarded as a generalization of the 
^ . block code case. 
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■ The main task of coding theory is the construction of powerful codes. This applies equally 

well to block codes and convolutional codes. In either case codes are required to have good 
error-correcting properties, i. e. a large distance, and an efficient decoding algorithm. 

In block coding theory this goal has been achieved best by the class of Reed-Solomon 
codes along with their efficient algebraic decoding algorithm. These codes are in particular 
MDS (maximum distance separable), meaning that they have the largest distance possible 
among all codes with the same length and dimension. On the other hand there are 
convolutional codes, and despite their frequent and successful use in engineering practice, 
their mathematical theory is still in the beginnings. The algebraic theory of this class 
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of codes has been initiated with the paper [2] of Forney and has seen a considerable 
development ever since. 

In particular, quite some efforts have been made in the area of constructing convolutional 
codes with large distance. The first group of the according papers appeared in the sev- 
enties of the last century. In ^3 El quasi-cyclic block codes have been used in order 
to construct convolutional codes with good distance. The relation between the weights 
of the block codewords and the convolutional codewords is made by the weight-retaining 
property. This topic has been resumed later on in [22; where the ideas have been used to 
construct MDS convolutional codes with (almost) arbitrary algebraic parameters. Other 
more recent attempts of constructing good convolutional codes try to impose additional 
algebraic structure on the convolutional codes themselves. In [Ij methods from algebraic 
geometry are used in order to construct convolutional codes of Goppa type. In the pa- 
per [HI system theoretic methods are used in order to analyze codes with optimal column 
distances. Finally, in ideas from the seventies have been resumed in order 

to impose a type of cyclicity on convolutional codes. The investigations of these cyclic 
convolutional codes have been continued in 011]. We will explain the notion of cyclicity 
later in Section 4 of this paper. At this moment we restrict ourselves to mentioning that 
cyclicity for convolutional codes is a more general notion than just the natural invariance 
of the code under cyclic shift. 

In the present paper we will combine the two main lines mentioned above. We will present 
a class of one-dimensional codes that are not only MDS but also have extended row 
distances increasing with slope n — 6 (where n is the length of the code and 6 the overall 
constraint length) . We will also compare the required field size needed for the construction 
with the field sizes of other constructions known in the literature. It will turn out that 
our field sizes are smaller for many parameters than what has been used before. For 
one set of parameters the field size is even only one above the theoretic minimum. In 
addition to these distance computations and field size investigations, we will also discuss 
the algebraic structure of these codes. As it turns out, for certain algebraic parameters 
the presented codes are cyclic in the sense mentioned above. In this case the codes can in 
fact be regarded as a generalization of (one-dimensional) Reed-Solomon codes. They even 
have a polynomial parity check matrix of Vandermonde type, showing that this class of 
codes are closely related to some of the codes given in P . 

The paper is organized as follows. In the rest of the introduction we will collect the 
preliminaries about convolutional codes. In Section 2 we will present the class of codes 
via their generator matrices along with their (extended row) distances and compare the 
field size to results from the literature. In Secton 3 various parity check matrices with 
Vandermonde structure are presented. In Section 4 we will introduce the notion of cyclicity 
for convolutional codes as it has been investigated in 0. We will show that in a certain 
(to be expected) case our codes are cyclic, and we will present various representations of 
the codes. We will close with some open problems. 

We end this introduction with the basic notions of convolutional coding theory. Convolu- 
tional codes are certain submodules of F[z]", where F is a finite field. Before presenting 
the definition we wish to recall that each submodule S of ¥[z]"' is free and therefore can 
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be written as 

S = imG:= {uG\ue¥[zf} 

where k is the rank of S and G G F[2;]'^^" is a matrix containing a basis of S. Hence, 
the matrix G is unique up to left multiphcation by a matrix from Glk{¥[z]). Moreover, 
by resorting to the Smith normal form one easily shows that G is right-invertible, i. e., 
GG = Ifc for some matrix G S F[2:]"^'^, if and only if the submodule imG is a direct 
summand of the module F[z]". This in turn is equivalent to the existence of a matrix 
H G F[z]('^-^)^" such that imG = kev := {v G F[z]" | vH^ = 0}. Using the theory of 
polynomial matrices it is easily seen that we may assume H to be right-invertible. Then it 
is unique up to left multiplication by a matrix from Gln-k{¥[z]). Obviously, the matrix H 
generates the dual module, i. e., imH = 5"*" := {w G ¥[z]"' \ wv^ = for all v G S}. 

This makes all of the following notions well-defined. 

Definition 1.1 Let F be any finite field. A convolutional code C C ¥[z]^ with (algebraic) 
parameters (n, k, 6) is a submodule of the form C = imG, where G G F[z]'^^" is a right- 
invertible matrix such that 5 = max{deg7 | 7 is a fc-minor of G}. We call G a generator 
matrix of the code. The number n is called the length, k is the dimension, and 6 is called 
the overall constraint length of the code. Each right-invertible matrix H G F[z]^""^''^"' 
satisfying C = ker is called a parity check matrix of C. 



Thus, the convolutional codes of length n are the direct summands of F[z]"'. It is worth 
mentioning that a code with overall constraint length zero can be regarded as a block 
code. In the coding literature a right invertible matrix is often called basic p. 730] or 
delay-free and non- catastrophic, see |14( p. 1102]. Sometimes in the literature convolutional 
codes are defined as subspaces of the vector space F((z))" of vector valued Laurent series 
over F, see for instance ^1] and |2j. However, as long as one restricts to right invertible 
generator matrices it does not make a difference whether one works in the context of 
infinite message and codeword sequences or finite ones, see also PUIITU] . 

The most important concept for a code is its distance. It measures the error-correcting 
capability, hence the quality, of the code. The definition of the distance of a convolutional 
code is straightforward. For a polynomial vector v = X^jLo^j-^"' ^ ¥[z]^ the weight is 
defined as wt{v) = X]^o^*(^i)' where wt{vj) denotes the usual Hamming weight of 
Vj G F". Then the (free) distance of a code C C F[2;]" with generator matrix G G F[z]'^^" 
is given as 

dist(C) := min{wt(w) \ v eC, v ^ 0} = mm {wt(nG) | u G ¥[zf, u / O}. 

Just like for block codes there exist quite some bounds on the distance of convolutional 
codes. One of them is the generalized Singleton bound [21\ Thm. 2.2]. It states that the 
distance d of a code with parameters (n, k, 6) over any field satisfies 



d<S{n,k,6):={n-k)(^ ^ +l)+5 + l. (1.1) 



Notice that S{n,k,0) = n — k + 1 which is the well-known Singleton bound for block codes. 
Like for block codes we call a code C with dist(C) = S{n, k, 6) an MDS code (maximum 
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distance separable), see j2H Def. 2.5]. Observe also that can easily be seen if A: = 1. 
Indeed, S{n,l,6) = n{5 + 1) is clear since in this case each generator matrix, being a 
codeword itself, obviously has weight at most n{5 + 1). 



2 A class of one-dimensional MDS codes 



In this section we present a construction of one-dimensional MDS convolutional codes. 
The distance will be computed straightforwardly. We will then compare our results with 
constructions known from the literature. Thereafter we will also compute the extended 
row distances. We will derive that they are increasing with a slope of n — 5. 

Theorem 2.1 Let n € N and < 5 < n — 1 and let q he a prime power such that 
n < q — 1. Put ¥ := ¥q and choose an element a G F such that ord(a) > n. Define 

5 

G:=Y^z''{l a" ... a("-i)'^) G F[z]i^" (2.1) 

and let C := imG C F[2;]". Then G is right invertible, i. e., the submodule C is a 
convolutional code, and dist(C) = n{5 + 1). In other words, C is an MDS code with 
parameters (n, 1,6). 



Notice that for 6 = the code is simply the n-fold repetition (block) code over F and the 
assertions are obvious. 

Proof: In order to show that G is right invertible, we have to prove that the entries of 
the matrix G are coprime. In other words, it needs to be proven that the polynomials 



u=0 



u=0 



i/=0 



have no common root in any extension field F of F. In order to see this, assume /? G F 
is such a common root. Then /3, a/3, . . . ,a"~^/3 are roots of Ylt=o'^'^- Since (3^0 and 
ord(a) > n, these numbers are pairwise different and 6 < n leads to a contradiction. 



Next we will prove that dist(C) = n{6+l). To this end put Gy := (1, a'^ , a 



2u 



, a 



{n--l)iy 



for u 

uG =: 



Q,...,6. Let u = Yli=o^i^^ ^ ^N) where t > and 7^ 7^ ut, and put 



Z]f=o "Vi^^ ■ Defining Gy := for 1/ < and u > 6, we have 



{uq, . . . ,ut)Gu, where G^ 



for = 0, . . . , 6-\-t. Notice that for v < 6 the first row, G^, of Gy is nonzero while for v >t 
the last row, G^-t, is nonzero. Since uq ^ ^ ut this will provide us with a good estimate 
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of the weight of Vi, for these indices. In order to see this, note that for each index u the 
nonzero rows of are consecutive and form a matrix of the type 



/I a'+'- a2(s+r) ... ^(„_l)(s+r)\ 



R :-- 



1 a^+l a2{s+l) ... 



where 0<s<s + r<6. Since ord(a;) > n > 5, the block code imi? C F" is MDS, that 



IS, 



dist(im ii) = n — r. 



(2.2) 



This will now be used for counting the weight of the vectors Vy for u € {0, . . . , 5, t, . . . , 5+i}. 
1. case: t> 5 

In this case the indices 0, . . . , 5, t, . . . , (5 + t are all different and we have 

/ \ 



Go 




for J/ = 0, . . . , (5 and G, 





Gs 
Gs-i 



for fj. = t, . . . , 6 + t 



(2.3) 



V / \G,^tJ 
and all displayed rows Ge are nonzero. Thus, using (|2.2|) . 

wt(t>,y) > n — 1/ for 1/ = 0, . . . , 5 and wt(f^) > n — {6 + t — n) for fi = t, . . . ,6 + t, (2.4) 
and therefore 

wt{v) > 2(n + (n - 1) + . . . + (n - 5)) = 2n((5 + 1) - 6{6 + 1) > n((5 + 1) (2.5) 

where the last inequality follows from 5 < n. 
2. case: t < S 

In this case we consider the indices 0, . . . , 6, 6 + 1, . . . , 6 + t. For u = 0, . . . ,t and for 
fi = 5 + 1, . . . ,6 + t the matrices G^, and are as in H2.3|) . while for A = t + 1, . . . , (5 we 
have 

/ Ga \ 

\Gx-tJ 

and, again, all block rows Ge of Ga are nonzero. From this and H2.2|) we obtain 

wt{v) > {n+{n-l) + . . . + {n-t))+{5-t){n-t) + {{n-t+l) + {n-t+2) + . . .+n) 

t-i 

= 2('tn - ^ + ((5 - t + l)(n - t) = n{6 + 1) + t(n - 5) > n{6 + 1). (2.6) 



i=0 
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This concludes the proof. 



□ 



The proof above also shows that uG with u € F'^yjO}, i. e., the nonzero constant multiples 
of G, are the only codewords having weight n{5 + 1). Indeed, the inequality in (|2.5|) is 
always strict and the last inequality in H2.6|) is strict for all t > 0. 

Remark 2.2 It is not hard to see that the matrix G in (|2.H) is also right-invertible for 
all (5 > n for which ord(a) \ 5 + 1. Examples show that these codes often have a large 
distance, too, but are not MDS in general. However, we can not provide any general result 
in this case. 

We would like to comment on the field size required for the construction of the MDS 
codes in Theorem 12.11 In 11, Lemma 1] and Thm. 3.7] it has been shown that if C 
is an (n, 1, 5)-MDS code over Fg then q> 5 + 1. In Theorem 12.11 the field size q satisfies 
q>n + \>5 + 2. Thus, in the case n = ord(a) = q — 1 and 5 = n — \ our field size 
is just one above the lower bound given above. As to our knowledge it is not known in 
general whether there exist (n, 1, n — 1)-MDS codes over F„ (in the case where n is a prime 
power). 

We also would like to compare our results with previous constructions of MDS codes. 
In |llj MDS codes with parameters (n, 1,(5) for certain combinations have been con- 
structed. However, these combinations are different from ours. For instance, the result in 
jlll Thm. p. 580] does not contain the case of (g — 1, 1, g — 2)-MDS codes over Fg and no 
(g — 1, 1, g — 3)-MDS codes over Fg where g > 5. On the other hand, the construction of 
that theorem allows the construction of a (17, 1,20)-MDS code over F32 which is not part 
of our Theorem 12.11 In [221 a construction of (n, 1,(5)-MDS codes is given over fields Fg 
where q > 5n+l. Except for the case 6 = 1 this is a considerably bigger field size than ours 
where q > n + 1. However, the construction in [22j works for all 5 and not just for 6 < n. 
Another construction of MDS codes is given in j^. Therein, MDS codes with (almost) 
arbitrary parameters {n,k,6) are constructed over fields ¥g of size q > ^^J^^^ + 2. The 
construction is based on cyclic block codes with large distance. In the case k = 1 this again 
amounts to a considerably bigger field than in our construction. Alternatively, one can 
also see directly that our codes are not derived from good cyclic block codes in the sense 
of H2]) i- e., the polynomial g = Yl^=i z^~^Gj{z"') derived from G = (Gi(z), . . . , Gn{z)) 
does not generate a good cyclic block code in general. 

We want to go into more details about the weight distribution of these codes and therefore 
give also lower bounds for the extended row distances. The extended row distances have 
been introduced in jl2l p. 541] and are very closely related to the trellis structure of the 
code and thus to its performance. Details on the importance of these distance parameters 
can be found in J2]^- The jth extended row distance amounts to the minimum weight of 
all paths through the state diagram starting at the zero state and which reach the zero 
state after exactly j steps for the first time. In other words, it is the minimum weight of 
all atomic codewords of degree j — 1 (i. e., length j) in the sense of ^^1- The details are 
also explained in [HI Sec. 3.10]. In our case where the dimension of the code is A; = 1, the 

'^The row distances, as defined in |S1 p. 114] do not give any further information. They are all equal to 
the free distance n{5 + 1). 
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atomic codewords are easily described. We will confine ourselves to the following property. 
It follows readily from the fact that the last 6 coefficients of the message u € ¥[z] make 
up the current state in the state diagram. 

Lemma 2.3 Let G G F[z]-'^^" be a right-invertible generator matrix of the code C := 
imG C F[z]" and let G have overaU constraint length 6 > 0. Let u € F[z]. Then the 
following are equivalent. 

(i) The codeword uG is atomic (i. e., the associated path through the state diagram does 
not pass through the zero state except for its starting and end point). 

(a) The polynomial u (z¥[z] does not have 6 consecutive zero coefficients. 

Having this property in mind, the j'th extended row distance of the code C = imG is 
defined to be 

"rr • r ^\ € Ffz], no / 0, degu = j — (5 — 1, 1 „ n - ^ r 

< := mm <^ wt(uG) . ■ \ r ^ for all 7 > (5 + 1. 

■' I no consecutive coefficients of u are zero J 

Notice that deg(n) = j — 5 — 1 implies deg(nG) = j — 1 and thus the associated path 
has length j. As for the index notation we diverge somewhat from the paper ^| where 
the index j equals the degree of the associated codewords while in our case it refiects the 
length. 

Proposition 2.4 Let C = imG C F[2:]" be the code described in Theorem \2.1\ Then the 
extended row distances satisfy 

d]>{n- 5)j + 6{6 + 1) for all j>6 + l. 

Hence the extended row distances are bounded from below by a linear function with slope 
n — 6. 

Before we prove this result we wish to mention that in a certain sense this result is the best 
one can expect. As Equation (|2.7j) below shows, the "middle" coefficients of a codeword 
are contained in an (n, 6 + l)-block code. The distance of this code is therefore a lower 
bound for the slope. In our case this code has optimum distance n — 6, therefore the 
weight of codewords increases at least linearly in the length with slope n — 6. However, 
in specific cases certain constellations of consecutive coefficients of the generator matrix 
might even allow a better row distance. After the proof we will present examples for both 
cases, where the estimate in Proposition 12.41 is actually an identity and where it is a strict 
inequality. 

Proof: Let u € ¥[z] and degu = j — 6 — 1>0. Then uG =: v = Yli=o ^j-^* degree 
j — 1 and length j. 

lfj-6-l<S, then (ESj) shows wt{v) > n{6 + 1) + {j - S - l){n - S) = 6{S + 1) + j{n - 6). 
Letnowj-(5-l > 6. From (fTll) we have wt( V^? q v,z^ + T^~^ . ^ v.z"^ > {2n-6){6+l). 
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Thus it remains to consider the coefficients Vi where i = 5 + l,...,j — 5 — 2. Since 



Vi 



/Go\ 
\GsJ 



(2.7) 



and V is atomic, the vector {ui,Ui-i, . . . ,Ui-s) is nonzero by Lemma IT3l Thus wt(i'i) > 
n — 6 hy l\'2.2\\ and we obtain 



wt{v) > {2n -5)i5 + l) + (j -26 -2){n- 5) = {n- 5)j + 6{6 + 1). 



□ 



In the following examples we consider various cases of the parameters n and 5 in Theo- 
rem 123 We computed the exact weight distribution (see |^ Sec. 3]) of the codes using 
Maple. 

Example 2.5 

(1) Let n = 3, (5 = 1 and F = F4 = {0, l,a,a^} where = a + 1. Consider G as in 
Theorem 12.11 that is 

G = {1 + z 1 + az 1 + a^z) . 

In this case one can show that the weight distribution is given by (see, e. g., [HI 
Sec. 3.10] and 15 ) 

00 

A{L, W) = SL'^W^il - 3LW^) = ^ 

meaning that all atomic codewords of length j have weight 2 + 2j and that there 
exist 3-'^^ of such codewords for each j > 2. As a consequence, the estimate in 
Proposition 12.41 is an equality, i. e., dj = 2 + 2j. One can also present explicitly an 
atomic codeword of length j and weight 2 + 2j. Indeed, it can be shown directly that 
wt((Ei=o = 2 + 2j for each j > 2. 

(2) Let 6 = 1, char(F) = 2 and n be arbitrary. Then it is easy to see 

2 + j{n — 1) for each j > 2, hence the estimate in Proposition 12.41 is an identity. 

(3) Let 6 = 2 and, again, n = 3, F = F4. Then G as defined in Theorem 12.11 is given by 

G={l + z + z^ l + az + a^z"^ l + a^z + az^). 
In this case the weight distribution is 

A(L, W) = 3T^^l3(1 + 2LW - 2LW^)/{6L^W^-6L^W^-3L'^W^-2LW^-LW + 1) 
= 3W^L^ + 9W^^L^ + (9VF" + 18iy^^ + 9W^^)L^ + 0{L^), 

meaning, for instance, that there are 36 atomic codewords of length five, 9 of which 
have weight 11 and 14, respectively, and 18 have weight 13. Using induction it is 
easy to see that in the series expansion for each j > 3 the coefficient of U is divisible 
by W^~^^ but not by W"^^^ . Hence, dj = 6 + j, and, like in (2), the estimate in 
Proposition 12.41 is an equality. Again, in this case one has wt((^Q z')G) = 6 + j for 
each j > 3. 
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(4) In general however, the inequahty for the jth extended row distance is not an equaUty 
and the growth rate can even be better. This happens for instance for n = 3, 5 = 2 
and F = Fg with, of course, ord(a) = 7. In this case the weight distribution of the 
code in Theorem 12.11 is 

A{L, W) = 7W^L^ + (21VF^° + 28VF^2)L^ + {UW'^'^ + 126W'^^ + U7W''^ + 105W^^)L^ 
+ {91W^^ + . . .)L^ + {63W^^ + . . .)L^ + (281^^'' + . . .)L^ 
+ (2SW^^ + . . + (15^1^ + . . + (561^20 ^ ^ _ -^jii 
+ (561^21 ^ ^ _ _)^12 ^ ^ _ ^ ^^^13 ^ ^ ^ _ _^^14 ^ o(^i5) 

where each sum "+ ..." is meant to contain only higher powers of W . This shows that 
the weight distribution is even better than the lower bound given in Proposition 12.41 
At least for small j we have dJ- > j + 5{6 + 1) = j + 6. 



3 Parity check matrices with Vandermonde structure 



In this section we will derive two types of parity check matrices for the codes of Theorem l2.1l 
in the case where ord(a) = n = 6 + 1, one of them being minimal in the sense of .'5 p. 459]. 
Both reveal a type of Vandermonde structure for these codes. 

Theorem 3.1 Let ord(a) = n and consider the matrix 

,2 



H :-- 



{Z - q")2 



{z - q"-1) 



z — a 

n-l\2 



\{z - a")"~i {z - a^-^Y 



z — a 

(z-q2)2 

{z - a2)"-i 



z — a \ 
iz - a)' 

{z - a)"-7 



G ¥[z 



(n— l)xn 



Then 

(1) H is right-invertihle, 

(2) GH^ = where G = ^"^zl z" (l 
Hence, in the case where ord(a^ 



n = 5 + 1, the code given in Theorem \2.1\ has parity 



check matrix H. 



The condition ord(a) = n is necessary for the theorem to be true. As can easily be checked 
the product GH^ is, in general, not zero if ord(a) > n. 

Proof: (1) For j = 1, . . . ,n let H'^^'i € F[z]("~^)^("~^) be the submatrix of H obtained 
by omitting the jth column. Then, due to the Vandermonde structure of H, we obtain 

n n n n 

deti/(^) = JJ (z-a'^) J] W {z-a^'-z + a'')=W{z-a'') JJ {a^'-a^'). 

hf — l 1^ — 1 fi — v'-\-l 1 l<;y</^<n 
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Since ord(a) = n, the last factor is nonzero for each j. But then the first factors show the 
coprimeness of the maximal minors of H, and thus H is right-invertible |14l Thm. A.l]. 
(2) Let G be given as above. Then for j = 1, . . . , n the jth entry Gj is of the form 



a^-'^z - 1 aJ-iz - 1 z- 0^-3+^ ' 



where for the last equality we used ord(a) = n. Thus, 



G=(a^- ^, a"-i^ -^,...,a- -). (3.1) 

\ z — z — ^ z — a 



Now we can prove GH^ = 0. For easier indexing we will write down the sums of the 
matrix product backwards. Then we have to show that 



" , - 1 



y a" (z - ay = for j = 1, . . . , n - 1. 

^-^ z — a" 

This is equivalent to 

n 

Ya^'iz-a'^y ={)ioT j = 0,...,n-2. (3.2) 

i/=i 

In order to see this, compute 

n n j y .s / A " 

J2 (y^i^ - (y^y = ( ) = ^^'^ I ^ ) (-1)'' Z 

Notice that for fixed /x = 0, . . . , j we have /x + 1 G {1, . . . , j + 1} and j < n — 2 yields 
I |-Q ord(Q;) = n. Therefore, 

" (/t+l)n 1 

^ a^(-+^ = a^(-+i) = _7 = for all ;x = 0, . . . , J. (3.3) 

This proves the Equations (|3.2j) and thus GH^ = 0. □ 

One should notice that the parity check matrix H is highly non- minimal, i. e., it is not a 
minimal basis for the dual code (for the notion of minimal basis see |31 p. 459] or |2l 
Sec. 2.5]). Obviously the leading coefficient matrix is the all-l-matrix and thus has rank 1 
only. This implies non-minimality of by [HI Main Thm.]. A minimal parity check matrix 
will be presented at the end of this section. 

The reader will have noticed that we did not make use of the Vandermonde parity check 
matrix H when computing the distances of the codes in the last section. As to our 
knowledge no theoretical result is known yet about the distances of convolutional codes 
with Vandermonde generator or parity check matrices. As an indication that such a 



10 



relation is not obvious, we would like to mention that Vandermonde parity check matrices 
of the form 



H :-- 



(z - a")'''+i (z - a'''^Y+^ 



{z - a'^Y {z - aY \ 
{z-ay+^ {z-aY^^ 



\ (z-a")" [z-a^'-^Y ■■■ {z-ay {z - aY J 

with 0<r<s<n— 1, i. e., with fewer rows than the matrix in Theorem 13.11 do not in 
general lead to good codes, even if ord(a) = n. This can easily be seen by running a few 
examples using, for instance, Maple. However, one should also notice the close relation of 
these matrices to those appearing in ^ Exa. 4.1]. In that paper methods from algebraic 
geometry are used to construct convolutional codes of Goppa type. A few examples of 
such matrices, but with different linear factors in the entries, are presented in which are 
generator matrices of MDS convolutional codes. A deeper understanding as to whether 
there is a relation between the distance of the codes ker or im H and the Vandermonde 
structure of H must be considered as one of the main tasks in algebraic convolutional 
coding theory. It might also have some impact on the possibility of algebraic decoding of 
these codes. 

Remark 3.2 With completely different methods it is possible to prove that also in the 
general case < J < n = ord(Q), the codes from Theorem 12.11 have a parity check matrix 
of a (somewhat modified) Vandermonde type. Indeed, in that case such a matrix is given 
by 



/ 



H 



z 

[z- 



- a 



a. 

2 



a 
z 

[z- 



n-5-l 



a 



2(n-l) 



\ 



a 



(n-<5-l)(n-l) 



n-l\2 



\(z-a")^ {z 



a 



a 



z 

(z- 



1\5 



- a 
aY 



Hence H is right invertible and satisfies GH^ 



(z-aY I 
0. The proof of this statement needs more 



detailed methods from the theory of cyclic convolutional codes as derived in (Tj and will 
be omitted. 



At the end of this section we will return to the case where b = n — \ and present a minimal 
parity check matrix, i. e., a right-invertible matrix with minimal row degrees in the sense 
of [SI p. 459] or [nj Sec. 2.5]. It shows that the dual code of imG has Forney index 1 
(counted {n — 1) times)^ and thus is a compact code in the sense of |141 Cor. 4.3]. 

Theorem 3.3 Let again ord(a) = n and define 



^The Forney indices of a code are defined to be the row degrees of a minimal generator matrix, see |14l 
p. 1081]. 
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Then Hmin is minimal and right-invertible and G H^^^ = 0, where G is again as in 
Theorem \3. 1^ 2). Hence in the case where ord(a) = n = 5 + 1 the matrix Hmin is a parity 
check matrix of the code given in Theorem \2.1i 

Proof: We use again the representation (|3.H) for the matrix G. Writing down the sums 
of the product GHJ^-^ backwards again we obtain 

But the last expression is zero for all j = 1, ... ,n — 1 as we have shown in (|3.3j) . From 
this we obtain that im Hmin ^ kerC"'" = (imG)^. Hence Hmin = BH, where If is a 
parity check matrix of the code imG and B is some polynomial matrix. By Thm. 3] 
the overall constraint length of imf^^ is n — 1, too. On the other hand it is seen directly, 
that, firstly, the matrix Hmin has full row rank k and that, secondly, the overall constraint 
length of im Hmin is the sum of the row degrees of Hmin because the highest coefficient 
matrix of Hmin is a Vandermonde matrix with full row rank, see also [31 p. 495]. Hence 
Hmin is minimal and both matrices Hmin and H have overall constraint length n — 1. This 
shows that det(i?) £ IP'\{0} and thus Hmin is right-invertible, too. □ 

Notice that Hmin = Hiz — Hq where both Hi and Hq have Vandermonde structure. It 
is worth mentioning that the dual codes, i. e., the codes generated by H or Hmin, are in 
general not optimal, that is, they are not MDS (this can be checked by a few examples). 
They do not even attain in general the Griesmer bound, see [HI Thm. 3.22] or [SJ Thm. 3.4] 
for the non-binary case. We also wish to point out the slight similarity of our construction 
with that in [171 pp. 445]. Therein, an MDS code with parity check matrix of the form 
Hiz + Hq, where Hq, Hi are Vandermonde matrices, is presented. However, in that 
construction the code has large dimension k > ^ while in our case k = 1. 



4 Cyclicity 



In this section we will show that for positive overall constraint length the codes given 
in Theorem 12.11 are cyclic if and only if ord(a) = n. Cyclic convolutional codes have 
been studied in detail in [7]. The first investigations in this direction have been made 
in the seventies by Piret ^Hl and Roos ^HJ- In both papers it has been shown (with 
different methods and in different contexts) that cyclicity of convolutional codes must not 
be understood in the usual sense, i. e. invariance under the cyclic shift, if one wants to go 
beyond the theory of cyclic block codes. As a consequence, Piret suggested a more complex 
notion of cyclicity which then has been further generalized by Roos. In both papers some 
nontrivial examples of cyclic convolutional codes in this new sense are presented along 
with their distances. All this indicates that the new notion of cyclicity seems to be the 
appropriate one in the convolutional case. Recently, in the paper 7 an algebraic theory of 
cyclic convolutional codes has been established which goes well beyond the results of the 
seventies. On the one hand it leads to a nice, yet nontrivial, generalization of the theory 
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of cyclic block codes, on the other hand it gives a very powerful toolbox for constructing 
such codes. We will now give a very brief introduction into cyclicity for convolutional 
codes before investigating this additional structure for the codes of Theorem 12. II 

Just like for cyclic block codes we assume from now on that the length n and the field 
size |F| are coprime. Recall that a block code C C F" is called cyclic if it is invariant under 
the cyclic shift, i. e. 

{vq, . . . ,fn-l) G C =^ {Vn-l,Vo, . . .,Vn-2) S C (4.1) 

for all {vq, . . . , Wn-i) £ F". This is the case if and only if C is an ideal in the quotient ring 
A := F[x]/^^n _i) = {Y. (x" - 1) I /o, ... , fn-i G f}, (4.2) 

i=0 

identified with F" in the canonical way via 

n—l 

p:Y^^A, {vo,...,Vn-i)^Y.''ix\ (4.3) 

1=0 



In order to extend this situation to the convolutional setting, we have to replace the vector 
space F" by the free module F[2:]" 
the ring A by the polynomial ring 



space F"' by the free module F[2:]" := {"^^^q z'^Vi, \ N G No, v,^ E F"} and, consequently. 



z 





iV G Nn, a,, G ^ 



over A. Then we can extend the mapping p above coefhcientwise to polynomials, thus 
^QZ^pivu) where, of course, v^, G F" and thus p{vu) G A for all i^. 
At this point it is quite natural to declare a convolutional code C C F[z]" cyclic if it 
is invariant under the cyclic shift, i. e., if (|4.H) holds true for all (vq, . . . ,Vn-i) G F[z]"'. 
Since, just like for block codes, the cyclic shift in ¥[z]'^ corresponds to multiplication by x 
in A[z], this amounts to the same as saying that C is called cyclic if p(C) is an ideal in A[z]. 
However, it has been shown in |161 Thm. 3.12] and |181 Thm. 6] that each convolutional 
code that is cyclic in this sense has overall constraint length zero, thus is a block code. An 
elementary proof can be found at [3 Prop. 2.7]. Due to this result Piret jE] introduced 
a different notion of cyclicity for convolutional codes which then was further generalized 
by Roos [IH]. This concept is based on some automorphism of the F-algebra A. Thus, let 
Autr{A) be the group of all F-automorphisms on A. It is clear that each automorphism 
a G AutF(^) is uniquely determined by the single value a{x) G A, but not every choice 
for cr(x) determines an automorphism on A. 

The main idea of Piret was to impose a new ring structure on A[z] and to declare a code 
cyclic if it is a left ideal with respect to that ring structure. The new structure is in general 
non-commutative and based on an (arbitrarily chosen) F-automorphism on A. In detail, 
this looks as follows. 
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Definition 4.1 Let a G AutF(A). 

(1) On the set A[z\ we define addition as usual and multiplication via 

az = z(j{a) for all a e A (4.4) 

along witli associativity and distributivity, wfiere multiplication inside A is defined as 
usual. This turns A[z\ into a (non-commutative) ring which is denoted by A[z]a\. 

(2) Consider the mapping 

N N 
p:F[z]"^^[z;a], J^z-^,^ J]z-p(^;,) 

where p : ^ ^4 is as in H4.3|) . A submodule C C F[2;]" is said to be a-cyclic if p(C) 
is a left ideal in ^[z; a]. 

A few comments are in order. First notice that, unless a is the identity, the indeterminate z 
does not commute with its coefficients. Due to this very specific non-commutativity the 
ring A[z;(t] is also called a skew-polynomial ring. Since (t|f = idp, the classical polynomial 
ring ¥[z] is a commutative subring of yl[z;(T], too. As a consequence, ^[z;(t] is a left 
and right F[2;]-module and it can easily be seen that the mapping p in (2) above is an 
isomorphism of left F[z]-modules. In the special case where a = idA the ring A[z;(t] is 
the classical commutative polynomial ring and due to the results mentioned earlier this 
does not result in any convolutional codes other than block codes. In many cases where a 
is not the identity there do indeed exist cyclic convolutional codes with positive overall 
constraint length. Characterizations along with several examples (actually all optimal 
with respect to their free distances) have been presented in [TjEll^. Another class of such 
codes is given by some of the codes of Theorem 12.11 Indeed, we have the following. 

Proposition 4.2 Let n be a positive integer coprime with |F| and let a & ¥ be such 
that ord(a) = n. Furthermore, let 5 E Nq and G be as in p.lj) . Then the ¥-algebra 
honiomorphism a : A — > A defined by cr{x) = ax is an ¥ -automorphism on A and the 
submodule C = imG is a-cyclic. 

In particular, if 5 < n, then C is a cyclic MDS convolutional code. 

Remember that only for specific values of 5 these submodules are actually convolutional 
codes, i. e., the matrix G is right-invertible, see also Remark 12.21 Recall also from the 
last section that in the case ord(a) = n > 5 the codes can be described by a certain 
Vandermonde parity check matrix. Therefore, in this case the codes have quite a rich 
structure. 

Proof: First of all, since (ax)*,i = 0, ...,n — 1 are linearly independent over F and 
(ax)"' = 1, the mapping a as defined above is indeed an automorphism on A. Consider 
now the submodule C = imG where G = Ylt=o^'^ '^'^ • ■ ■ a^"'"^^'')- We have 

to prove that p{C) is a left ideal in A[z;a]. Since p(C) is a left F[2;]-module, it suffices to 
show that p(C) is closed with respect to left multiplication by x. Thus, consider the image 
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of G under the mapping p, i. e., define 

(5 n-1 



g := p(G) = Y.^"Y. ^ ^[^; ^] ■ (4-5) 

Now it suffices to show that )p~^{xg) € imG, or, stated differently, that xg is a left F[z]- 
multiple of g. But this can easily be seen since 

<5 n— 1 5 n—1 S n—l 

!^=0 j=0 !^=0 i=0 i'=0 j=0 



This result proves in particular that if ord(Q) = n, then the codes in Theorem 12.11 are 
cyclic. In the sequel we want to show even more. Indeed, we will prove that the condition 
ord(a) = n is even necessary and sufficient for the codes of Theorem 12. II to be cyclic with 
respect to some F-automorphism. 

To this end we need some more details about the coefficient ring A. Due to the coprimeness 
of n and |F|, this ring is a direct product of fields. Indeed, let 

- 1 = TTi • . . . • vr^, (4.6) 

where tti, . . . ,TTr € F[x] are irreducible, monic, and pairwise different. Then the Chinese 
Remainder Theorem tells us that 

■0 : A — > Ki X ... X Kr, a 1 — > (a mod tti, . . . , o mod Tir) , (4-7) 

where = ¥[x]/ ^^^y is an isomorphism if x^^iKi is endowed with componentwise 
addition and multiplication. Notice that Ki^. = Ki if and only if deg^, vr^ = deg^jTr/. The 
elements 

^(fc) :=0-i(o, ...,0,1,0,..., 0) for /c = l,...,r (4.8) 

(where and 1 have to be understood as the elements Omod tti and Imod vr/ in Ki) are 
particularly important since they form the uniquely determined set of primitive idempo- 
tents in A. The idempotents are pairwise orthogonal, thus = for k ^ I. Observe 

that for any a ^ A the products e^'^a single out the various components of a. Precisely, 
ip^e^^^a) = (0, . . . , 0, amod vr;, 0, . . . , 0) for any / = 1, . . . , r. Therefore, e^^^a + . . . + e^^^o 
is a decomposition of a G A just like the one in (|4.7j) and in the sequel we will use this 
representation rather than that from 1)4. 7() . 

It is straightforward to see that a given automorphism a S AutF(74) induces a permutation 
on the set of primitive idempotents. More precisely, 

a{£^^'>) = e^') for some / such that deg^, iTk = deg^ vrj. (4.9) 



The following example will be important for our purposes. 



15 



Example 4.3 Let a G F be such that ord(a) = n. Then — 1 decomposes mto hnear 
factors, precisely, 

- 1 = (x - l){x - a) • . . . • (x - a"-^). (4.10) 

Along with the n irreducible factors, we also have n idempotents. We will denote them by 
e(°\ . . . ,e(""^). Due to they have to satisfy e'^'^^a'') = 6k,i for all A:, i = 0, . . . , n - 1. 
Thus, the idempotents are of the form 

n-l 

e^''^ = 7fc - a*) for some 7^ G F*, A; = 0, . . . , n - 1. 

In particular, e(°) = 70 ^^J"/ = 7o Y17=o ^^'^^ (^) ~ shows that 70 = which indeed 

exists in F* since n and |F| are coprime. Consider now the automorphism a G AutiF(^) 
defined by a{x) = ax, see Proposition 14.21 Then, using ord(a) = n, we obtain 

n— 1 n— 1 n—1 

1=0 i=0 1=0 

i^k ij^k i^k — 1 

Since a{£^^^) is one of the idempotents again, see (|4.9() . it follows a{e'^^'>) = £^^^^^ for 
k = 0, ... ,n — 1, where we take exponents modulo n. In particular, a'^{e^^^) = e^'"'^^\ 
Hence a induces the permutation with cycle notation 

(^e(n-l),£(n-2)^...^e(l)^^(0)^_ (4.11) 

Using a^{x) = a^x we obtain from the above 

n— 1 

^(n-v) ^ = e{0)(a^^) = - y (a^x)^ for > 0. 

n ^ 

«=o 

This shows that the polynomial g in (|4.5() satisfies 

<5 5 5 

l/=0 !/=0 V=Q 

We will make use of this representation later on. 



Now we can prove that the codes in Theorem 12 . 1 1 are cyclic if and only if ord(a) = n. Just 
like in Proposition 14. 21 we will consider arbitrary overall constraint length b. However, the 
case J = needs to be excluded since it gives us, for any order of a, a cyclic (block) code. 
We will make heavy use of the results derived in [7j. 

Theorem 4.4 Let n G N be such, that n and |F| are coprime and let a G F he such that 
ord(a) > n. Moreover, iet (5 G N and put G as in 1)2. 1() . Define C := imC Then C is 
a-cyclic for some a G AutF(A) if and only if ord(a) = n. In this case C is a-cyclic for the 
automorphism a G AutF(^) defined via a{x) = ax. 
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Proof: The if-part as well as the additional statement have been proven in Proposi- 
tion g^l 

"Only-if-part" : Let C = iniG be u-cyclic for some a € AutF(A). Let x" — 1 = ni=i 
be the prime factorization of — 1 such that vri = x — \. Denoting the idempotents by 
e^^\ . . . , e^*"^ we have in particular e^^^ = ^ = ^ YI^Zq x"^- By assumption ord(a) > n 
and we have to show that ord(a) = n, hence that r = n and, up to ordering, tTj = x — q*~^ 
for i = 1, . . . , n. By assumption, p{C) is the left ideal generated by g as given in (|4.5jl . 
Since rankC = 1, the generator matrix G is unique up to a nonzero constant in F. Thus, 
the generator of the left ideal is also unique up to a constant factor and, along with [3 
Cor. 4.13 and Thm. 4.15(b)], this shows that the polynomial g is reduced in the sense of 
[7| Def. 4.9(b)]. Moreover, since the code is one-dimensional we obtain from [7| Thm. 7.13] 
that g = e^'^^g for some k = 1, . . . ,r such that degTr^ = 1. In particular we have go = e^^'^go 
for the constant coefficient go of g. Since (|4.5j) yields go = ne^^'^ and e^'^^e^^) = for k > 1, 
we conclude k = 1, thus g = e^^'^g. Therefore, 

50 + zgi + z^g2 + ... + z^gs = e^^^g^ + za{e'^'^^)gi + zV2(e(i))(^2 + • • • + z^ {e'^'^^)g5 

where g^ is the coefficient of z^ in g. Hence, g,j = a'^{e^^^)gi, for all i' = 0, ... ,6. Moreover, 
since 5 > we have a{e^^^) ^ e*-^^ for otherwise the code would have overall constraint 
length zero, see 01 Lemma 3.4]. Consider now the coefficient g\ = ^"=o^(oa;)*- The 
equation gi = a{e^^^)gi along with a{e^^^) ^ e^^-* and the orthogonality of the idempotents 
implies e'^^^gi = 0. Substituting x = 1, we obtain Y17=o ~ ^- -^^^ then J27=o "*(«— 1) = 
a"" — 1 = which along with the assumption ord(a) > n implies ord(a) = n. □ 

We want to close the paper with yet another representation of the cyclic codes considered 
so far. In [3 Prop. 7.10] it has been shown that a polynomial g G A[z; a] with the property 
g = e'^^^g for some k = 1,. . . ,r generates an ideal that is a convolutional code, i. e., a 
direct summand in the left F[z]-module A[z;(t], if and only ii g = e'^^^u for some unit 
u G A[2;;(t]. More details about this can be found in Prom Example 14.31 we can easily 
derive how such a unit looks like in the case of the equivalent conditions of Theorem 14.41 
if 8 is not too big. 

Proposition 4.5 Let ord(a) = n and let a G Autir(yl) be defined via cr{x) = ax. Fur- 
thermore, let 1 < 6 < n. Let x" — 1 be factored as in (|4.10|l and denote the corresponding 
idempotents by e^^^ , • • • , e^"~"^^ . Tiien the polynomial g from ()4.5() satisfies g = e^^^u where 

u = n{l + ze(""i))(l + ze("-2)) ..... (i + ze("^'^)) 

and u is a unit in A[z;a]. 

Proof: First of all we have for each j = 1, . . . , n 

(1 + ze("-J'))(l - ze("~J')) = (1 - ze("-J'))(l + ze^"-^')) = 1, 

since a(e("-^)) = due to (jHU, and since the idempotents are pairwise orthogonal. 

Thus u is indeed a unit in A[z; a]. Moreover, using the identity a{£^^^) = £^''~^\ see 1)4. llj) . 
and again the orthogonality of the idempotents one can show by induction on 6 that 

n— 1 n—2 n—S 

u = n(^l + z Y £^''^ + z^ Y £^^^ + ••• + •2'' Y for 5= l,...,n, 

k=n—S k=n—5 k=n—S 
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where a sum is zero if the lower index is strictly bigger than the upper one. From this 
and (|4.12|) one can easily see that e^^^ti = g. □ 



Since the element n € F is a unit in A[z;(7] we can summarize the results of Example ESI 
and the previous proposition as follows, see in particular ()4.12|) and also ()4.5() . 

Theorem 4.6 Let ord(a) = n and a € AutiF(A) be such that cr{x) = ax. Let C be as in 
Theorem \2.1l Then 

(a) p(C) is the left ideal in A[z; a] generated by the elenaent 

5 n—1 n—1 5 

u=0 i=0 i=l v=0 

(b) p{C) is the left ideal generated by the element 

5 

e(o) = e(o)(l + + ze^"-^)) • . . . • (1 + ze(«-'5)). 

The representation on the right hand side of (a) justifies to call these codes one-dimensional 
Reed-Solomon convolutional codes. 

In the paper [3] representations of cyclic codes via units like on the right hand side of (b) 
above have been studied in detail. Therein, it has been investigated as to which algebraic 
parameters (field size, dimension, overall constraint length, and Forney indices) can be 
realized by cyclic convolutional codes. In particular, a construction of certain compact 
cyclic convolutional codes (i. e., all Forney indices are the same) has been derived. How- 
ever, no distance results have been obtained in that context. As has been shown in [S] the 
presentation as on the right hand side of (a) seems to be more suitable for a generalization 
to codes of higher dimension with good distance. 



Open Problems 



We have presented a class of one-dimensional convolutional codes with maximum possible 
distance. In the specific case where ord(a) = n these codes are cyclic and have a Vander- 
monde parity check matrix. Without using explicitly Vandermonde matrices, but highly 
the theory of cyclic convolutional codes, first attempts are currently under investigation 
of how to generalize the construction of cyclic convolutional codes with large distance 
to higher dimensions, see 0. In general, we consider it most important to understand 
whether Vandermonde structure of a cyclic convolutional code can be exploited for dis- 
tance computations and algebraic decoding algorithms. We think that the one-dimensional 
cyclic MDS codes with their rich structure as presented in this paper might be a good 
starting point in this regard. 
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